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Abstract 

We investigate statistical inference across time scales. We take as toy 
model the estimation of the intensity of a discretely observed compound 
Poisson process with symmetric Bernoulli jumps. We have data at times 
iA for i = 0, 1, . . . over [0, T], for different sizes of A = At relative to T in 
the limit T — > oo. We quantify the smooth statistical transition from a mi- 
croscopic Poissonian regime (A T — »■ 0) to a macroscopic Gaussian regime 
(At — > oo). The classical quadratic variation estimator is efficient in both 
microscopic and macroscopic scales but surprisingly shows a substantial 
loss of information in the intermediate scale At — > Aoo e (0, oo) that can 
be explicitly related to A^. We discuss the implications of these findings 
beyond this idealised framework. 

Keywords: Discretely observed random process, LAN property, Information 
loss. 
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1 Introduction 

1.1 Motivation 

We specialise in this paper on the example of a discretely observed compound 
Poisson process with symmetric Bernoulli jumps. This toy model is central to 
several application fields, e.g. financial econometrics or traffic networks (see the 
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discussion in Section 3 and the references therein). Moreover, it already con- 
tains several interesting properties that enlight a tentative concept of statistical 
inference across scales. Consider a 1-dimensional random process (Xt) defined 
by 

Nt 

X t = X + Y,^, t>0, (1.1) 
i=i 

where the £j S { — 1, 1} are independent, identically distributed with 

P( e< = -l)=P( ei = l) = i 

and independent of the standard homogeneous Poisson process (Nt) with in- 
tensity $ G © = (0,oo). Suppose we have discrete data over [0, T] at times iA. 
This means that we observe 

X = (Xq,Xa, ■ ■ • ^[TA- 1 ]); (1-2) 

and we obtain a statistical experiment by taking P# as the law of X defined by 
(1.2) when (Xt) is governed by (1.1). 

On the one hand, if we observe (Xt) microscopically, that is if A = At — > 
as T — > oo, then asymptotically, we can - essentially - locate the jumps of (Nt) 
that convey all the relevant information about the parameter 

In that case, X is "close" to the continuous path (Xt,t 6 [0, T]). On the 
other hand, if we observe (Xt) macro scopically, that is if At — > oo under the 
constraint 1 T j At — > oo , we have a completely different picture: the diffusive 
approximation 

XiA T ~ X (i _ 1)AT « v^A7(^A T - W {i _ 1)Ar ), (1.3) 

becomes valid, where (Wt) is a standard Wiener process. Inference on i? es- 
sentially transfers into a Gaussian variance estimation problem; in that case, 
the state space rather becomes J+i^ pj na j^y if we observe (Xt) in the 

intermediate scale < liminf At < limsupAy < oo, we observe a process 
presenting too many jumps to be located accurately from the data, and too 
few to verify the Gaussian approximation (1.3). Therefore, depending on the 
scale parameter At, the state space may vary, and it has an impact on the 
underlying random scenarios P$, although the interpretation of the parameter 

This condition ensures that asymptotically infinitely many observations are recorded in 
the limit T — > oo. 
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of interest $ remains the same at all scales. What we have is rather a family of 
experiments 

£ T ' A = {Pj' A ,tfe9}, (1.4) 

where Pp A denotes the law of X given by (1.2) and these experiments £ T '^ 
may exhibit different behaviours at different scales A. Heuristically, we would 
like to state that in the microscopic scale At — > 0, the measure PV T conveys 
the same information about ■& as the law of 

(N t , t G [0, T]), (1.5) 

that is if the jump times of (Xt) were observed. On the other side, in the 
macroscopic scale At — > oo with T j At — > oo, the measure P^' At shall convey 
the same information about i? as the law of 

(0,V^W AT) ...,v^W LrA -ij), (1.6) 

that is if the data were drawn as a Brownian diffusion with variance i?. The 
following questions naturally arise: 

i) How does the model formulated in (1.4) interpolate - from a statisti- 
cal inference perspective - from microscopic (when A = At —> 0) to 
macroscopic scales (when A = At —> oo)? In particular, how do intrinsic 
statistical information indices (such as the Fisher information) evolve as 
A = At varies? 

ii) Is there any nontrivial phenomenon that occurs in the intermediate regime 

< liminf At < limsupAy < oo? 

iii) Given i) and ii), if a statistical procedure is optimal on a given scale A, 
how does it perform on another scale? Is it possible to construct a single 
procedure that automatically adapts to each scale A, in the sense that it 
is efficient simultaneously over different time scales? 

1.2 Main results 

In this paper, we systematically explore questions i), ii) and iii) in the simplified 
context of the experiments £ T ' A built upon the continuous time random walks 
model (1.1) for transparency. Some extensions to non-homogeneous compound 
Poisson processes are given, and the generalisation to a more general compound 
law is also discussed. As for i), we prove in Theorems 2.2, 2.4 and 2.5 that the 
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LAN condition (Locally Asymptotic Normality 2 ) holds for all scales A. This 
means that can be approximated - in appropriate sense - by the law of 
a Gaussian shift. We derive in particular the Fisher information of £ T,A and 
observe that it smoothly depends on the scale A. We shall see that the answer to 
ii) is positive. More precisely, we first prove in Theorem 2.6 that the normalised 
quadratic variation estimator 

1 LTA?lj 

$T V = f E {XiA T ~ ^(i-l)A T ) 
i=l 

is asymptotically efficient - it is asymptotically normal and its asymptotic vari- 
ance is equivalent to the inverse of the Fisher information - in both microscopic 
and macroscopic regimes. In the microscopic regime, it stems from the fact that 
the approximation 

0<t<T 

becomes valid, as the jumps are ±1, and the efficiency is then a consequence 
of Nt/T being the maximum likelihood estimator in the approximation exper- 
iment (1.5). In the macroscopic regime, thanks to the diffusive approximation 
(1.3) we have 

[TA _1 J 

W*^ E (^(^a t -^ (1 _ 1)At )) 2 , 
i=i 

which is precisely the maximum likelihood estimator in the macroscopic ap- 
proximation experiment (1.6). Surprisingly, fails to be efficient when 

At — > Aqo G (0, oo). (1.7) 

More precisely, we show in Theorem 2.7 that, although rate optimal, dj, misses 
the optimal variance by a non-negligible factor, depending on Aoo, that can 
reach up to 23%. This phenomenon is due to the fact that in the intermediate 
regime (1.7), the process (Xt) is sampled at a rate which has the same order 
as the intensity of its jumps. On the one hand, (X»a t — ^(«-i)A T ) 2 gives no 
accurate information whereas a jump has occured or not during the period 
[(i — 1)At,{Ax], contrary to the case At — > 0. On the other hand, there 

2 Recommended references are the textbooks [8] and [14], but we recall some definitions in 
Section 2.2 for sake of completeness. 
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are not enough jumps to validate the approximation of Xi Ar — X(i-i)A T by a 
Gaussian random variable, contrary to the case At — > oo. Finally, we construct 
in Theorem 2.9 a one-step correction of that provides an estimator efficient 
in all scales, giving a positive answer to iii). 

The paper is organised as follows. We first propose in Section 2.1 a canoni- 
cal framework for different time scales by considering the family of experiments 
^£T,A T ^ The way the scale parameter depends on T defines the terms mi- 
croscopic, intermediate and macroscopic scales rigorously. Specialising to model 
(1.1) for transparency, the results about the structure of the corresponding 
{£ TAt ) t>0 are stated in Section 2.2. We show in Theorems 2.2, 2.4 and 2.5 
that the LAN (Local Asymptotic Normality) property holds simultaneously 
over all scales and provides an explicit expression for the Fisher information. 
The proof follows the classical route of [8] and boils down to obtaining accurate 
approximations of the distribution 

f AT (0, k) = Fl AT (X iAt - X (i _ 1)AT = k), k€Z, 

in the limit Ay — > or oo. Note that /a t C#, k) does not depend on i since (X t ) 
has stationary increments. However explicit, the intricate form of /a t ($> k) re- 
quires asymptotic expansions of modified Bessel functions of the first kind. In 
the macroscopic regime however, we were not able to obtain such expansions. 
We take another route instead, proving directly the asymptotic equivalence in 
the Le Cam sense, a stronger result at the expense of requiring a rate of conver- 
gence of Ay to oo, presumably superfluous. We show in Theorems 2.6 and 2.7 
of Section 2.3 that the quadratic variation estimator #2 is rate optimal and 
efficient in both microscopic and macroscopic regimes, but not in the intermedi- 
ate scales (1.7). This negative result is however appended with the construction 
of an adaptive estimator, constructed by a standard one-step correction of $p 
based on the likelihood at intermediate scales, and efficient over all scales (The- 
orem 2.9). Moreover this estimator has the advantage of being computationally 
implementable, contrary to the theoretical optimal maximum likelihood estima- 
tor. Section 3 gives some extensions in the case of a non-homogeneous compound 
Poisson process (Theorem 3.2) and addresses the generalisation to more general 
compound laws. The comparison to related works on estimating Levy processes 
from discrete data is also discussed. Section 4 is devoted to the proofs. 
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2 Statement of the results 



2.1 Building up statistical experiments across time scales 

Let T > and A > be such that A < T. On a rich enough probability 
space (Q, T, P), we observe the process (Xt) defined in (1.1) at frequency A -1 
over the period [0, T]. Thus we observe X defined in (1.2), and with no loss of 
generality 3 , we take Xq = 0. We obtain a family of statistical experiments 

£ T A := ( Z L^- 1 J,P(Z^ 1 J),{PJ A ,,9 e 8}), 

where P»' denotes the law of X when (Xt) has the form (1.1), and C (0, oo) 
is a parameter set with non empty interior. The experiment £ T ' A is dominated 
by the counting measure ht on iX^ T K Abusing notation slightly, we may 1 
(and will) identify X with the canonical observation in £ T ' A . Since (Xt) has 
stationary and independent increments under P^' , we obtain the following 
expression for the likelihood 

— *—(X)= H fA(#,X iA -X (i „ 1)A ), 

where we have set, for k E Z, 

U(#, k) := P^ A (X iA - X (i _ 1)A = fc) = P^ A (X A = k). (2.1) 

We shall repeatedly use the terms microscopic, intermediate and macroscopic 
scale (or regime). In order to define these terms precisely, we let A = A^ depend 
on T with < A^ < T, and we adopt the following terminology. 

Definition 2.1. The sub-family of experiments (£ T ' At )t>o is sa id to be 

1. On a microscopic scale (or regime) if At — > as T — >■ oo. 

2. On an intermediate scale (or regime) if At — > Aoo as T — > oo, for some 
Aoo E (0,oo). 

3. On a macroscopic scale (or regime) if At — > oo and T/At oo as 
T ^ oo. 



3 By assuming Xo = 0, the first data point does not give information about the parameter 
i9. If only asymptotic properties of the statistical model are studied, which is always the case 
here, it has no effect. 

4 By taking for instance fi = Z [ta t 1 J . 
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2.2 The regularity of {8 > t )t>o across time scales 

Let us recall 5 that the family of experiments (£ T ' At )t>o satisfies the Local 
Asymptotic Normality (LAN) property at point i9 G with normalisation 
ItAt(^) > if, for every » £l such that # + f/T,A T (^) 1//2 G 0, the following 
decomposition holds 

i mTAr 

— ^ (X) =exp« T - -u +r T ), (2.2) 



where 
and 



£r -> A/"(0, 1) in distribution under Pp Ar as T -»• oo (2.3) 

r t — > in probability under PV T as T — )■ oo. (2-4) 

If (2.2), (2.3) and (2.4) hold, we informally say that (£ T '^ t )t>o is regular with 
information It,a t ($)- This means that locally around the law of X can be 
approximated by the law of a Gaussian shift experiment, where one observes a 
single random variable 

Y = $ + ItaA$)~ 1/2 Zt, 

with £r being approximately distributed as a standard Gaussian random vari- 
able under P^' T as T — > oo. In particular, the optimal rate of convergence for 
recovering $ from X is the same as the one obtained from Y. It is given by 
It,At($) provided ir,A T ($) — > co as T — > oo. Note also that if the convergence 
of the remainder term = rjn(v) in (2.4) holds locally uniformly in v, then 
Ir,A T (^) can De replaced by any function JtAt($) such that 

Jt,a t ($) ~ It, At ($) as T — ^ oo 

without affecting the LAN property. Hereafter, the symbol ~ means asymptotic 
equivalence up to constants. Our first result states the LAN property for the 
experiment {£ T ' A ) T>0 on every scale A G (0, oo). 

Theorem 2.2 (The intermediate regime). Assume At —> Aqo G (0, oo) as 
T — >• oo. T/ien f/ie family (£ t,At )t>o is regular and we have 

ItaM -ItaooW = ta 00 (eJ a - [(/ IAoc (^,x Aoo )-(^a 00 )- 1 |Aa 00 |-i) 2 



3 See for instance the textbooks [8] or [14]. 
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with 

h Aoo (#,k) - 



where, for x£l and i/GN, 

_ v (s/2)*"+» 

m>0 v 7 

denotes the modified Bessel function of the first kind. 

Remark 2.3. By taking At = A ro 6 (0, oo) constant, we include the case of a 
fixed A, therefore the same regularity result holds for {£ T '^)t>o' An inspection 
of the proof of Theorem 2.2 reveals that the mapping A It a($) is continuous 
over (0, oo). 

Our next result shows that formally we can let Aoo — > in the expression 
of ir.AooC^) given by Theorem 2.2 in the microscopic case. Moreover we obtain 
a simplified expression for the information rate. 

Theorem 2.4 (The microscopic case). Assume At — > as T — >• oo. Then the 
family {£ T '^ t )t>o is regular and we have 

ItAtW) ~ bflW) ■= lim /t,a(i?) = t- 

A— >0 77 

The macroscopic case is a bit more involved. In that case, we cannot formally 
let Aqo — > oo in the expression of /r,A 00 ('i?) given by Theorem 2.2. However we 
have the following simplification. 

Theorem 2.5 (The macroscopic case). Assume At —> oo, T j At — > oo as 

T -> oo and T/A T + ~ A = o((log(T/A T ))~i) . T/ien £/te family (£ TAt )t>o is 
regular and we have 

TA -1 

The condition T/A T 4 = o((log(T/Ar))~3) is technical but quite strin- 
gent; it stems from our method of proof, see Section 4.3. It is presumably 
superfluous, but we do not know how to relax it. 
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2.3 The distortion of information across time scales 



On each scale A > 0, let us introduce the empirical quadratic variation estima- 
tor 

L TA-1 J 

i=l 

that mimics the behaviour of the maximum likelihood estimator in both macro- 
scopic and microscopic regimes (see Section 1). More precisely, we have the 
following asymptotic normality result. 

Theorem 2.6. Let A = A^ > be such that T/At — > oo as T -4 oo. We have 

d TA T =$+ feW- 1 + /T,oo (tf)- 1 ) 172 ^, 

where £t -4 A/"(0, 1) in distribution under IP$' A , and It,o($) an d It,oo ($) are the 
information of the microscopic and macroscopic experiments given in Theorems 
2.4 and 2.5 respectively. 

On a microscopic scale Ay — > 0, we have 

- — -—- ->0 as T -> 00. 
It,o{v) 1 

On a macroscopic scale At — > 00 with T/Ay — >■ 00, we have on the contrary 



as T — > 00. 



As a consequence, we readily see that "#7? ^ T is asymptotically normal and that 
its asymptotic variance is equivalent to It,o($) on a microscopic scale and to 
It,oo('&)~ 1 on a macroscopic scale. At a heuristical level, this phenomenon can 
be explained directly by the form of the empirical quadratic variation estimator, 
as we already did in Section 1. At intermediate scales however, this is no longer 
true. 

Theorem 2.7 (Loss of efficiency in the intermediate regime). Assume that 

< lim inf A T < lim sup A T < — . (2.5) 

Then 

BmM fr J ,w-'+/T.-W- >1| 

where It,A t ($) is defined in Theorem 2.2. 
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Remark 2.8. For technical reasons, we are unable to prove that Theorem 2.7 
remains valid beyond the restriction lim sup Ay < 1/ (4$) . Numerical simula- 
tions suggest however that Theorem 2.7 is valid whenever lim sup Ay < oo, see 
Figure 2.3. 

Let us denote by 

the squared error loss of the quadratic variation estimator. By Theorems 2.2, 
2.4 and 2.5, the family (£ T ' At )j> >0 is regular in all regimes and we may apply 
the classical minimax lower bound of Hajek, see for instance Theorem 12.1 in 
[8]: we have, for any $o £ ® an d 5 > such that [i9q — $, $o + $] C O 

liminf sup / t ,a t (i9)^t(^ta >#) ^ L ( 2 - 6 ) 

r^oo |^_ tfo |<,5 ' ' T 

On the one hand, Theorem 2.6 suggests 6 that the lower bound (2.6) can be 
achieved in microscopic and macroscopic regimes. On the other hand, Theorem 
2.7 shows that inequality (2.6) is strict in the intermediate case, whenever the 
restriction (2.5) is satisfied, thus revealing a loss of efficiency in this sense. 
Define 

<p(0, A) = E^ A [{h A (#,X A ) - (tfA)- 1 ^! - l) 2 ], 

where h A ('&, k) is defined in Theorem 2.2. An inspection of the proof of Theorem 
2.7 shows that tpifi, A) = ?/>($A), for some univariate function ijj, and that 

lTfli X A \^ rl = ^ a)2 + ^ ■ 

The maximal loss of information is obtained for 

A*(tf) ~ ^~ 1 axgmax x>0 ip(x)(2x 2 + x) 

as T — > oo. Numerical simulations show that the maximum loss of efficiency is 
close to 23%. Since {£ T ' A ) T>0 is regular for every A > 0, an asymptotically 
normal estimator with asymptotic variance equivalent to /t,a($) -1 is given by 
the maximum likelihood estimator. However due to the absence of a closed-form 
for the likelihood ratio that involves the intricate function /a defined in 
(2.1) (see also Section 4.1.1), it seems easier to start from i?^ A which is already 
rate-optimal by Theorem 2.6 and correct it by a classical one-step iteration 

This is actually true as the uniform integrability of $y At under P^' , locally uniformly 
in can easily be obtained. We leave the details to the reader. 
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L°0, M 



Figure 1: Deficiency ratio through scales (x-axis x = i?A, logarithmic scale). Ratio 
between the information It,a t ($) and the inverse of the variance of $® v At . The maximum is 
1.2297 and is attained at x = 0.600. 



based on the Newton-Rhapson method, see for instance the textbook [14] pp. 
71-75. To that end, define 



20S 

T, A 



QV 



EEi A?1J d^ogf A {d^ v A ,X iA - X {i _ 1)A ) 



El=f" J a|log/ A (^,X iA - X (l _ 1)A ) 



QV 



(2.7) 



Theorem 2.9. In all three regimes (microscopic, intermediate and macro- 
scopic), we have 



rV2 (2QS_ 



'V'.At (^T,A 



i?) —^(0,1) 



i J' At -distribution as T — )■ oo. 



Proof. In essence the regularity of f A {y^ A ,Xi A — Xu^^j enables to apply 
Theorem 5.45 of Van der Vaart [14]. □ 



Theorem 2.9 expresses the fact that automatically adapts to It,a 7 

and is therefore optimal across scales. 



3 Discussion 



The compound Poisson process (Xt) with Bernoulli symmetric jumps defined 
in (1.1) is the simplest model of a continuous time symmetric random walk on a 
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Log 10 (A| 



Figure 2: Information deficiency through scales (# = 1, a>axis A, logarithmic 
scale). Information It,a t (solid black). Inverse of the variance of $y^ T , theoretical (solid 
grey) and empirical (star green). The inverse of the empirical variance 

of #t,a t (dotted blue) 

is close to the optimal It,a t on all scales. The empirical variances were computed using 10 4 
Monte-Carlo simulations from T/A = 10 5 data, i.e. with the same number of data through 
scales. 



lattice that diffuses to a Brownian motion on a macroscopic scale. The intensity 
{) of the Poisson arrivals on a microscopic scale is transferred into the variance 
$ of the Brownian motion on a macroscopic scale: 

(^XtT, t G [0, 1]) — ► (VtiWt, t G [0, 1]) (3.1) 

in distribution as T — > oo, where (Wj) is a standard Brownian motion. The 
statistical inference program we have developed across time scales on the toy 
model given by (Xt) can be useful in several applied fields. For instance, in 
financial econometrics, (Xt) may be viewed as a toy model for a price process 
(last traded price, mid-price or bets bid/ask price) observed at the level of the 
order book, see e.g. [2] or [9]. The parameter i? can be interpreted as a trading 
intensity on microscopic scales that transfers into a macroscopic volatility in the 
diffusion regimes. Our results convey the message that if a practitioner samples 
(Xt) at high frequency at the same rate as price changes, which is customary 
in practice, then the realised volatility estimator ^ T is not efficient, and a 

modified estimator like i?^ t should be used instead. However, this framework 
is a bit too simple and needs to be generalised in order to be more realistic in 
practice. Two directions can be explored in a relatively straightforward manner: 
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i) The extension to a non-homogeneous intensity Poisson process. 

ii) The extension to an arbitrary compound law on a discrete lattice. 

Extension to the non-homogeneous case 

Theorems 2.2, 2.4 and 2.5 extend to the non-homogeneous case, when one allows 
the intensity of the jumps to depend on time. In this setting, the counting 
process (N t ) defined in (1.1) is defined on [0,T] and has intensity 

A T (f,<i?)= f A(i9, j,)ds, fortG[0,T] 
Jo 

where 

A: X [0,1] -»■ (0,oo) 
is the nonvanishing (integrable) intensity function, so that the process 

{N t -A T (t,0),te [0,T]) 

is a martingale. The homogenous case is recovered by setting A(i9,i) = $ for 
every t G [0, 1]. In this context, the macroscopic approximation (3.1) becomes 




in distribution as T — > oo. We state - without proof - an extension of Theorems 
2.2, 2.4 and 2.5 for the associated family of experiments (£ t,At ) t>q across 
scales. 

Assumption 3.1. We have that l? ~» A(i9,t) is continuously differentiate for 
almost all t £ [0, 1] and moreover sup# e e )te r 0) i] A($, t) < oo. 

Theorem 3.2. We have Theorems 2.2, 2.4 and 2.5 with the following gener- 
alisation 

1. In the microscopic case At — > 0, 

ItM = T / (dolog\({>, s)) 2 \(v),s)ds. 
Jo 

2. In the intermediate regime At — > Aoo £ (0, oo), 

/•l 

/ T (t?) = TA 00 / (^logA^s))^^^) 2 ^^^)^, 

JO 
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with 

ff(tf, S )=Ej' A ~ [(h Aoo (#,s,X A J + (\(V,s)A oo y 1 \X±J-l) 2 ], 
and 

U / n 7 \ X| fc | + 1 (A(^,S)A 00 ) 

l+i 



5. in i/ie macroscopic case At —> oo u;it/t T j At — > oo and T / A T 4 
((log(T/A T ))-i), 



T"i A — 1 />1 

It{#) = —^I (d»log \($,s)) 2 ds. 







The proof of Theorem 3.2 relies on the approximation 



;a t / i - 1 



A(#, = A T A tf, — T + A T r T , 
(i-l)A T v 1 A T ' 

for j = 1, . . . , [TA^ 1 ] , where tt — > as T — > oo in all three regimes. Assump- 
tion 3.1 ensures that the convergence of the remainder is uniform in i and 
This reduction enables us to transfer the problem of proving Theorems 2.2, 2.4 
and 2.5 when substituting independent identically distributed random variables 
by independent non-equally distributed ones. This is not essentially more diffi- 
cult, and the regularity of A enables us to piece together the local information 
given by each increment Xi& T — Xu_x)& T 111 or der to obtain the formulae of 
Theorem 3.2. 

An analogous program as in Section 2.3 for the distortion of information 
could presumably be carried over, with appropriate modifications. For instance, 
one can show that 

^TA T = ^2 ( XiA r ~ x (i-i)A T ) 2 — > X(&,s)ds asT-^oo 
i=i ^° 



in PV T -probability, in all three regimes. Then, in order to estimate i? efficiently, 
one should rather consider a contrast estimator that maximises 

LTA" 1 ] 

0~>U t ,a t (&)= Yl 9A T {K^,i^T),X iAT -X {i _ 1)AT ) 
i=i 

for a suitable function <7ta t ; an d make further assumptions on existence of a 
unique maximum for the limit - whenever it exists - of Ut,a t under P^' T as 
T — > oo. We do not pursue this here. 
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Extension to more general compound laws 

The situation is a bit more delicate when one tries to generalise Theorems 2.2, 
2.4 and 2.5 to an arbitrary compound law k) , k G Z), for every $ G 0, 
with 

O<C(0,A:)<1, for k G Z and J^C(0,fc) = l, 

(and £($, 0) = for obvious identifiability conditions). We then observe a pro- 
cess (Xt) of the form (1.1), except that the jumps (ej) are now distributed 
according to 

F(e,i = k) = C(0,fc), fc G Z. 

In order to keep up with the preceding case, we normalise the compound law, 
imposing 

J2k({'&,k) = and ^fc 2 C(M) = 1- (3-2) 

First, in the microscopic case, we approximately observe over the period [0, T] 
a random number of jumps, namely Nt which is of order -dT. Second, con- 
ditionally on Nt, the size of the jumps form a sequence of independent and 
identically distributed random variables with law k). On the other side, in 
the macroscopic limit, the effect of the size of the jumps is only tracked through 
their second moment, which is normalised to 1 by (3.2). Therefore it gives no 
additional information about The situation is rather different from the case 
of symmetric Bernoulli jumps: here, the extraneous information about i? lies in 
the effect of the jumps, which are recovered in the microscopic regime and lost 
in the macroscopic one. There is however one way to reconcile with our initial 
setting, assuming that the compound law ((k) does not depend on $ and is 
known for simplicity. Then, for k G Z, we have 

/a(0, k) = f>l A {X tA - X {l _ 1)A = k) = J2 C^W^-j-^A)" 1 , 

m£Z 

where (* m (k) is the probability that a random walk with law £(fc) started at 
reaches k in m steps exactly. Therefore 

f A (#,k) = e-< )A g k ($A), with 5 t (i)=VrW-. (3.3) 

L — ' ml 

meZ 

In the symmetric Bernoulli case, we have Qk{x) = Xifci(x), where T v (x) is the 
modified Bessel function of the first kind. Anticipating the proof of Theorems 
2.2, 2.4 and 2.5, analogous results could presumably be obtained for an arbitrary 
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compound law ((k) satisfying (3.2), provided accurate asymptotic expansions 
of Gk{x) are available in the viscinity of and oo. The same subsequent results 
about the distortion of information that are developed in Section 2.3 would pre- 
sumably follow, with the same estimators $ta ano - ^?A T > anc ^ the appropriate 
changes for /a($>&) in (2-7). 

Relation to other works 

Concerning the estimation of the law of the jumps, say we have an inverse 
problem. One tries to recover £ from the observations of a compound Poisson 
process, the link between £ and the law of the process being given by (3.3). In 
the setting of positive compound laws, Buchmann and Griibel [4, 5] succeed to 
invert that relation and give an estimator of £ in the discrete and continuous 
case. That method which consists in inverting (3.3) is called decompounding. It 
was generalised by Bogsted and Pitts [3] to renewal reward processes when the 
law of the holding times is known, inrestriction to the case of having positive 
jumps only. 

The compound Poisson process is a pure jump Levy process that can be 
studied accordingly. Using the Levy-Khintchine formula, it is possible to esti- 
mate nonparametrically its Levy measure which is given by the product $ x £ in 
that case. This strategy is exploited by van Es et al. [15] for a known intensity. 
This estimation procedure does not restrict to compound Poisson processes and 
it includes the case of pure jump Levy processes in general. Nonparametric esti- 
mation of the Levy measure from high frequency data (that corresponds to our 
microscopic case At — > 0) is thoroughly studied in Comte and Genon-Catalot 
[6] as well as in the intermediate regime (with At = Aoo fixed) in [7]. In that 
latter case, we also have the results of Neumann and Reifi [12]. 

4 Proofs 

4.1 Preparation 

4.1.1 Some estimates for /a(^j^) 

We have, for k £ Z: 

-i?A 

/a(#, k) = P^' A (X A = k) = Y J Vm(k)^-(#A) m 

m>0 



16 



where <p m (k) is the probability that a symmetric random walk in Z started from 
has value k after m steps exactly: 



if |A;| > m or \k\ — m is odd 

2~ m | , otherwise. 
^{m+\k\)J 



\2< 

Let us introduce the modified Bessel function of the first kind' 

w - E ,'fT + u - 

m>0 v ; 

for every x, v S R, and where 

/•+00 

r(a;)= / t x - x e- l dt 
Jo 

denotes the Gamma function. Straightforward computations show that 

/ A (tf,fc) = expH?A)X w (tfA). (4.1) 

See for instance [13], p. 21 Example 4.7. We gather some technically useful 
properties of the function T u (x) that we will repeatedly use in the sequel. 

Lemma 4.1. 1. For every x £ M\{0} and v € R, we have 

d x I v (x) = T v+1 {x) + -X v (x). (4.2) 

x 

2. For every fi > v > — \ and x > 0, we have 

l^x) <l u (x). (4.3) 

Proof. Property 1 can be found in the textbook of Watson [16] and readily 
follows from the fact that x I u (x) is analytical with an infinite radius of 
convergence. Property 2 is less obvious and follows from Nasell [11]. □ 



7 The function x I v {x) can also be denned as the solution to the differential equation 



2 d V d V i 2 
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4.1.2 The Fisher information of £ T ' A 

For i = 1, . . . , |_TA J , let £ i ' denote the experiment generated by the obser- 
vation of the incerement Xj A — -^"(j-i)A- Since (Xt) has independent stationary 
increments, we have, for k E Z 

Pl' A (X iA - X (i _ 1)A = fc) = Pj' A (X A = fc) = / A (0, k). 

Using that Xq = 0, it follows that 

ITA" 1 ] 

£ TA = (g) ^ A ( 4 - 4 ) 

i=i 



as a product of independent observations given by the increments Xj A — X( i _ 1 j A , 
each experiment being dominated by the counting measure on Z with 

density / A (i9, fc) given by (4.1) that does not depend on z. Moreover, ^ ' has 
(possibly infinite) Fisher information given by 



M#)=K AT [(^o g f A ^X A )f]=^ ^j^ < +oo 

which does not depend on i. We study the regularity of £ T ' A in the classical 
sense of Ibragimov and Hasminskii (see [8] p. 65). 

Definition 4.2. The experiment £ i ' is regular (in the sense of Ibragimov and 
Hasminskii) if 

i) The mapping •& ~» / A (i?, k) is continuous on for every k E Z. 

ii) T/ie Fisher information is finite: 3 A (i9) < +oo /or every t? E 0. 

iii) 27ie mapping i? 9^ (/ A (^ 5 •) 1 ^ 2 ) continuous in ^ 2 (Z). 
Lemma 4.3. T/ie experiments Sf'^ are regular. 

Proof. For every A; E Z, ,/ A (t?, &) = exp(— -#A)Z| fc | (#A), therefore i) is readily 
satisfied since i9 E C (0, oo) and A > 0. We also have fA($, k) > for every 
E Z, then 3 A (i?) is well defined, but possibly infinite. In order to prove ii), 
we write 

^log/ A (tf,X A ) = 3* log (e^ A Z| XA |(tfA)) - A + ^^f^ 

= A(/ iA ( ? ?,X A ) + (^A)- 1 |X A |-l), (4.5) 



2 
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where we have set, for every k G Z, 

^|fc|(^ A ) 

and used Property (4.2). It follows that 

J A (<?) = A 2 E^' A [(/ lA (i?,X A ) + (^A)- 1 |X A | -l) 2 ]. (4.6) 

Moreover the function |X A | X|x A |(i?A) is decreasing (see (4.3)), thus 

0</^ A (^,X A )<l (4.7) 

and since X A has all moments under , we obtain ii). We proceed similarly 
for iii). First, for any i? E and e such that $ + e G 0, we have 

^(A^ + e^) 1 / 2 )-^^^^) 1 / 2 ) =edl(f A (# £ ,k) 1 / 2 ) 

for some $ e 6 [$,# + £]. Second, we write 

Mf^,k)^ 2 ) = /A(tf £ ,A0 1/2 §^log/ A 0? e ,fc), 

and, differentiating a second time, we obtain that c^(/ A (# e , fc) 1 / 2 ) equals 

/a (t? e , fc) 1/2 ( ( I log / A (0 6 , k) ) 2 + \dl log / A (0 e , k) 



Therefore, taking square and summing in k, we derive 



e 2 ET' A 



^log/ A (^,X A )) 2 + i^log/ A (^,X A ))" . (4.8) 



From ii), we have 

d#logf A {#,X A ) = A(h A (#,X A ) + {&A)- 1 \X A \-l), (4.9) 



ji A 

and this last quantity has moments of all orders under Pv , locally uniformly 
in 1?. Likewise, using (4.9) and (4.2), it is easily seen that 

81 log f A (0, X A ) = A 2 %aI+2(^A) + M - lhA ^ ^ 

- A 2 /i A (tf,X A ) 2 -tf- 2 |X A |. 

Thus S 2 , log / A ($, X A ) has moments of all orders under P^' At locally uniformly 
in i?, thanks to (4.3) and (4.7). The same property carries over to the term 
within the expectation in (4.8) and we thus obtain iii) by letting e — > 0. □ 
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By the factorisation (4.4), we infer that £ T ' A has Fisher information 

3 taW - [T WW - [TA-j Y, {a t^f 

feez JA ^ ' ' 

which is finite thanks to ii) of Lemma 4.3. 
Lemma 4.4. For every $ £ 0, we have 

Jt,a(0) = ITA-MA^EJ^ [(h A (0,X A ) - ( 1 9A)- 1 |X A | - l) 2 ]), (4.10) 
Moreover in the microscopic and intermediate regimes, we have 



ItAtW 



-»• 1 as T^oo. (4.11) 



o/ Lemma 4.. 11. In the course of the proof of Lemma 4.3, we have seen by (4.6) 
that 

3 A (#) = A 2 Ep A [(/i A (^,X A ) + (i9A)- 1 |X A | -l) 2 ]. 
It follows that 

3taW = L^A^JA 2 Ep A [(M^a) + (i?A) _1 |Xa| - if] 
= lta- 1 ja 2 (e^ a [(/ iA (^^A) + (^A)- 1 |X A |) 2 ] 

+ 1 - 2Ep A [h A (#,X A ) + (tfA)" 1 ^!] 



Since £^ ' is regular by Lemma 4.3, we have Ey [5^ log / A (i9, X A )J = 0. 
Combining this with the equality 

d#logf A (ti,X A ) = A(h A (#,X A ) + ($A) _1 |X A | - 1) 

that we obtained in (4.5), we derive 

Ep A [h A (#,x A ) + (^A)- 1 !^!] = 1, 

and (4.10) follows. Expanding (4.10) further, we obtain the useful representation 

j T , A (tf) = L^a- 1 ] (a 2 E^' A [h A ($,X A f] 

+ ^Ej A [\X A \h A (V,X A )) + f " A 2 ). (4.12) 

Let us now assume that A = At — > 0. We will need the following asymptotic 
expansion of the function T v {x) near 0. 
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Lemma 4.5. We have, for v € N, 

Z,(x) = ^^(l + xrv(x)), (4.13) 
where x r u (x) is continuous and satisfies sup !/>0 r u (x) — > w/ien x — > 0. 

Proof of Lemma 4-5. We have an expression olZ u (x) as a power series, thus its 
Taylor expansion in a neighborhood of is given by 

M*) = (f)^( 1 + a:r "K)' 



X ^! /X\ 2 ™ v-^ 1 fX\ 2m „,2/ 9 



where 

r ^ x ) = 4 z ( m + i\u m + i +v \\ (2) - x s^y 

m>0 m>0 

Then x ~> r„(x) is continuous and satisfies sup„ >0 r u (x) — > when x — >• 0. □ 
By Lemma 4.5, a simple Taylor expansion shows that 

where \r' T {d, X A )\ < c(i?), for some deterministic locally bounded c(i?). Plugging 
this last expression in (4.12), we obtain 

M#) = ^ + TA T rU#), 

with r'i, having the same property as ry, whence (4.11) in the microscopic case. 
In the intermediate case, since A ~» E^' A [(h A (#,X A ) + ( 1 9A)" 1 |X A | - 1)'] 
is continuous on (0, 00), we readily obtain the result using that [^A^JA^-, is 
equivalent to TAt as T — > 00. The proof of Lemma 4.4 is complete. □ 



4.2 Proof of Theorems 2.2 and 2.4 



A technically convenient consequence of Lemma 4.4 in the microscopic and 
macroscopic cases is that it suffices to prove Theorems 2.2 and 2.4 with 3t,a t ($) 
instead of It,At($)i provided the convergence (2.4) is valid locally uniformly. 
As £ t>At is the product of £j' T generated by the AjA T — -X7i-i)A T that form 
independent and identically distributed random variables under PV T with 
distribution depending on T, we are in the framework of Theorem 3.1' p. 128 
in Ibragimov and Hasminskii [8] and the LAN property is a consequence of the 
following two conditions: 
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i) For every $o £ @ and h such that i?o + h 6 0, we have 

Iff,,,', g|a|(/(^)-)|^o 

as T — > oo. 

ii) For every /i > and $ G 0, we have 

j TA T (J) e I' At [(^ lo g/(^' X A T )) 2 l{|a^io g /(,?,x A )|>M TiAT w}] ^0 
as T — > oo. 

In the same way as for the proof of ii) in Lemma 4.3, we have 

dJ/ AT (tf, k) = §/a t (0, fc) 1/2 (£( - A T + |fc|<r a + A T hA T (#, k)f 

- \^- 2 + A H I ^jPy ~ hA T (^kf) + % T (tf, k)) 

= lfA T & k ) 1/2 U AT &k), say. 
Therefore, taking squares and summing in k, i) is proved if we show that 

^ TA JaL SU P E I' AT [^a t (^,^a t ) 2 ] ^0 (4.14) 

as T — >• oo. Using (4.3), we have 

< fc AT (<?,fc < 1 and < |! l+2V qA ' < 1, 

hence 7-La{$, X& T ) 2 is less than 

c(0)(A£ + A T (1 + A T )X At + (1 + )xi T + (1 + A t )|X At | 3 + Xi T ) 
for a locally bounded c(#), which in turn is less than 

C '(tf,A T )(A T + A T + xi T +xA T ), (4.15) 



for some (/(•&, At) locally bounded on x [0, oo). Since (XA is a compound 
Poisson process under P^' T with intensity i? and jumps in { — 1, +1} with equal 
probability, the characteristic function of X At is explicitly given by 



e t,At [yuXA,,] = exp ( - 0A T (1 - coshu)), uGR, 
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from which we obtain 

K AT l x l] = $ A t(1 + 30 A T ). (4.16) 

Integrating (4.15), we derive 

4' At [^A T (tf,X AT ) 2 ] < c"(ti, A T ) A T , 

where c" has the same property as d . Since 3t,A t C#o) is of order T as T — > oo 
in both microscopic and intermediate scales, we obtain (4.14) and i) follows. 

It remains to prove ii). Prom the explicit representation 

fylog/A^fc) = A T (-l + h AT (&,k)) + |fc|?T 1 (4.17) 
and the fact that < /ia t ($j fc) < 1, we have 

|a*log/ Ar (0,fc)| < A T + \k\^-\ 

from which we readily obtain 

E^ At [(^log/A T (^,X AT )) 2 ] < </"(0, A T ), 

where d" has the same property as d '. Since 3t,A t (^) — > oo as T — » oo, we 
obtain ii) in both microscopic and intermediate scales. The proof of Theorems 

2.2 and 2.4 is complete. 

4.3 Proof of Theorem 2.5 

The strategy of the proof is quite different from that of Theorems 2.2 and 2.4, 
for we were not able to obtain asymptotic expansions for I u (x) in a viscinity of 
x = oo with appropriate bounds on the stochastic remainders. 

Consider instead the experiment Q t At = {Q^' At , $ G 0} generated by the 
observation of IjTA^ 1 ] independent centred Gaussian random variables with 
variance My, for At — > oo satisfying the rate restriction 

T/A T +l =o((log(T/A r ))-i) (4.18) 

We plan to prove that under the restriction (4.18), the experiments S T ' T and 
Q t At are asymptotically equivalent as T — > oo. Theorem 2.5 then follows from 
the Le Cam theory, see for instance [10]. To that end, we map each increment 
XiA T - ^(i-i)A T in £ TAt with 

y. Ar = XiA T - ^(i_i)A T + Ui 
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where the f7, are independent random variables uniformly distributed on 
[— |, g]- Let us denote by £ t At = {P^' , i? G G} the experiment generated 
by the Y^ T . Since the increments XjA — -X~^_i)a t take values in Z, we have a 
one-to-one correspondence between YJ At and the increment XjA T — Xu—i)A T 
and therefore the experiments £ T '^ T and £ t,At are equivalent. Moreover, £ T <^ T 
and Q T, '^ T live on the same state space m\- t ^ t J and have smooth densities with 
respect to the Lebesgue measure. The proof of Theorem 2.5 is therefore implied 
by the following bound 

||S' At - Ql AT \\tv as T -> oo, 

locally uniformly in # and where 1 1 • 1 1 denotes the variational norm. This 
bound is implied in turn by the bound 

\\C(Yf T )-tf(0,#&T)\\TV = o((T/A T )- 1 ) (4.19) 

locally uniformly in $, since each experiment is the L(r/Aj')~ 1 J-fold product 
independent and identically distributed random variables 8 . Let us further de- 
note by Ptf,A T and q$ : A T the densities of Y 1 T and the Gaussian law J\f(0, Mr) 
respectively. We have 

\\£{Y l AT )-Af(0,#A T )\\ TV = [ \P*M X ) ~ Q*M x )\<te 

< 7 + 77 + III, 

where, applying sucessively the triangle inequality and Cauchy-Schwarz, for any 

T] >0, 




/2 



II = F$ T (\X At + U 1 \> V ), 
111= I qtf,A T {x)dx. 

J\x\>t] 

Set r\ = t}t = k^At log(T/Ay). We claim that for k 2 > 2$, the terms 7, 77 
and 777 are o((T/A^ 1 )) hence (4.19) and the result, for an appropriate choice 

8 For instance, by using the bound 

|| P ®« „q®« || TV < ^2(1 - (1 - HI P-Q ||tv)") 1/2 . 



24 



of k so that the convergence can hold locally uniformly in d. Since q 1 ? ; a t (ar) is 
the density of the Gaussian law jV(0, Mr), we readily obtain 



,2 



/// < 2exp ( - = (T/A T )- 2 /^) = o((T/A^)) 

using k 2 > 2$. For the term II, we observe that since \Ui\ < 1/2, we have 

^ T {\X At + U X \ > V t) < ^ T (| £ *\ ^ - |), 

i=i 

where the £j G { — 1, 1} are independent and symmetric. By Hoeffding inequality, 
this term is further bounded by 

2E?. ^ [exp ( - fcif )] < 2(exp ( - ^f) + pj^ (iV AT > «'A T )) 
for every «/ > 0. If re' < k 2 /2, one readily checks that 

ew(- ( -^f)=o((T/A T )-i). 
Moreover, if k' > i?, we have, by Chernov inequality, 

(N At - i?A T > («' - 0)A r ) < exp ( - A t (k' log(it' /#) - («' - 0))) 



nT, A 



and this term is also o((T/A^) _1 ). Thus 77 and III have the right order and 
it remains to bound the main term /. By Plancherel equality we obtain the 
following explicit expression: 

/ {p4,At( x ) ~ <l#A T (x)) 2 dx = {2tt)- 1 / (ptf,A T (0-*VVr(0) 2d £ 
JR JR 

= (2ir)- 1 [ (e-^-^^i-e-^^YdC 
Jr^ | > 



< /y + V + W, 
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with 



IV = (2TT)- 1 

V = 2(2tt)- 1 



-*A t (i-oob(-^)) Sin (^fc 



|5|> Pv ^ 



e 2 



1^2 <% 



-2i?A T (l-cos(-^=)) / sin ( 



2 V / S7 



/At' 



2VAt 



/A) 



y/ = 2(2^)- x / 

J\£\ 



for any p > 0. By a first order expansion, we have that IV is less than 



L 



+ 



A 2 T 



A T _|_ 



A 7 



/A? 



for some bounded function £ —» a(£). Set a* = sup^. |a(x)|. We thus obtain that 
IV is less than a constant times 



*n2 -(#-2pa*)£ 



We 



2 d£ 



/A, 



If we pick p such that > 2 pa, the term IF is of order A T 



5/2 



. For the term 



V 



L 



e -2^A T (l-cosO / Slri (| ) \ 2 
- 2 



noting that (sinx) 2 = (l — cos(2x))/2, we bound the 27r-periodic, even and 
continuous function £ ~~» e -2i?A T (l-cos^)^ — cos£) by its supremum (4ei?AT) _1 . 
The integr ability of £ -2 away from enables to conclude that V is of order 
A^ 1 . Finally, by Gaussian approximation, we readily obtain that VI is of order 



A -l/2 e -p 2 tfA T _ 



In conclusion, we have that L (p#,A T (x) — q-d,& T {x)) 2 dx is dominated by 

— 1 1/2 —1/2 

the term V and is thus of order A T . It follows that / is of order riJ A^ 



and the choice tjt = K\/ At log(T/ At) implies / = o((T/At) x ) thanks to the 



restriction condition T/A T 
is complete. 



o((log(T/AT)) 4 ). The proof of Theorem 2.5 
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4.4 Proof of Theorem 2.6 



Set 

. {XiA T - X (i-i)A T ) -VAt 

Qi,T ~ 



(LTA T 1 Ji?A T (l + 2 tfA T )) 1/2 ' 

Under P^' T , the variables £i t x are independent, identically distributed, and we 
have 

LTA- 1 ] 

4' At M = and ]T Var[fc, r ] = 1 

i=l 

by (4.16). Moreover, for every 5 > 0, 

E ^' AT [l^| 2l {fe,T|>5}] ^0 as T^oo, 

i=l 

I TA -1 1 

therefore, by the central limit theorem Ut = Yli=i T £i,T J* (0, 1) i n distri- 
bution under PV T as T — > oo in all three regimes (microscopic, intermediate 
and macroscopic). We thus obtain the following representation 

$t V a t = T^At \TA T l \ + T- 1 ( LTA T X J A T 0(1 + 2tf A T )) V V T 
and the result follows from T^A^L^A^ 1 ] ~ ^ anc ^ 

T- 2 LTA T 1 JA T? ?(1 + 2tfA T ) ~ /r,o(^)- 1 +/ r ,oo(i3)" 1 
as T — >• oo in all three regimes. 



4.5 Proof of Theorem 2.7 

By (4.11) of Lemma 4.4, it suffices to prove 

hminf Or Tl A r W(iT,o(i?)" 1 + iT I oo(i?)" 1 ) > 1. (4-20) 

Up to taking a subsequence, we may assume that A^ — > A G (0,1/(4??)] as 
T — )• oo. Using (4.12) and Theorem 2.6, for every ■& £ O, we have 

3t,At (0)(It,o(0)- 1 + ItM^)' 1 ) 
~ (MEf [/ia(^,^a) 2 ] +2E^' A [|Xa|^a(^X a )] + 1 - 0A) (2tfA + 1) 
= : Al(i?A) as T -> oo, 
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where A4 is a univariate function since PV has density /a($;^) = 
e~ X[ fc [(i9A) with respect to the counting measure on Z. Therefore, Theorem 
2.7 is equivalent to proving that 

M(x) > 1 for every x G (0, |]. (4.21) 

Using (4.2) of Lemma 4.1 we have 

=1 - 4??A + (1 + 40A) E^' A [/ia(^,^a) 2 ] +4E^' A [|Xa|M#,*a)] 
+ 2(l + 2tfA)E^' A [(|X A |+^A/iA(^XA))%^(^XA)] 

where 

5^a(^, fc) = + fc) " ^ fc)2 

is positive (see Theorem 1 of Baricz [1]) and /i(i9, k) is in [0, 1] according to (4.3) 
of Lemma 4.1. We derive 

d x M(x) > 1 - Ax > for x £ (0, ij , 

hence (4.21). Since .A/f (x) — )• 1 as x — )• 0, the conclusion follows. 
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